A Random Walk-Based Model for Identifying Semantic Orientation
نویسندگان
چکیده
Automatically identifying the sentiment polarity of words is a very important task that has been used as the essential building block of many natural language processing systems such as text classification, text filtering, product review analysis, survey response analysis, and on-line discussion mining. We propose a method for identifying the sentiment polarity of words that applies a Markov random walk model to a large word relatedness graph, and produces a polarity estimate for any given word. The model can accurately and quickly assign a polarity sign and magnitude to any word. It can be used both in a semi-supervised setting where a training set of labeled words is used, and in a weakly supervised setting where only a handful of seed words is used to define the two polarity classes. The method is experimentally tested using a gold standard set of positive and negative words from the General Inquirer lexicon. We also show how our method can be used for three-way classification which identifies neutral words in addition to positive and negative words. Our experiments show that the proposed method outperforms the state-of-the-art methods in the semi-supervised setting and is comparable to the best reported values in the weakly supervised setting. In addition, the proposed method is faster and does not need a large corpus. We also present extensions of our methods for identifying the polarity of foreign words and out-of-vocabulary words.
منابع مشابه
SEIMCHA: a new semantic image CAPTCHA using geometric transformations
As protection of web applications are getting more and more important every day, CAPTCHAs are facing booming attention both by users and designers. Nowadays, it is well accepted that using visual concepts enhance security and usability of CAPTCHAs. There exist few major different ideas for designing image CAPTCHAs. Some methods apply a set of modifications such as rotations to the original imag...
متن کاملA Fuzzy Random Walk Technique to Forecasting Volatility of Iran Stock Exchange Index
Study of volatility has been considered by the academics and decision makers dur-ing two last decades. First since the volatility has been a risk criterion it has been used by many decision makers and activists in capital market. Over the years it has been of more importance because of the effect of volatility on economy and capital markets stability for stocks, bonds, and foreign exchange mark...
متن کاملRefining Image Annotation by Integrating PLSA with Random Walk Model
In this paper, we present a new method for refining image annotation by integrating probabilistic latent semantic analysis (PLSA) with random walk (RW) model. First, we construct a PLSA model with asymmetric modalities to estimate the posterior probabilities of each annotating keywords for an image, and then a label similarity graph is constructed by a weighted linear combination of label simil...
متن کاملIdentifying the semantic orientation of terms using S-HAL for sentiment analysis
0950-7051/$ see front matter 2012 Elsevier B.V. A http://dx.doi.org/10.1016/j.knosys.2012.04.011 ⇑ Corresponding author at: MOE Key Laboratory Network Security, Xi’an Jiaotong University, Xi’an 71 82667964. E-mail addresses: [email protected] (T. Xu (Q. Peng). Sentiment analysis continues to be a most important research problem due to its abundant applications. Identifying the semantic orien...
متن کاملRandom Walk on WordNet to Measure Lexical Semantic Relatedness
The need to determine semantic relatedness or its inverse, semantic distance, between two lexically expressed concepts is a problem that pervades much of natural language processing such as document summarization, information extraction and retrieval, word sense disambiguation and the automatic correction of word errors in text. Standard ways of measuring similarity between two words on a thesa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Computational Linguistics
دوره 40 شماره
صفحات -
تاریخ انتشار 2014